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NUCLEIC ACIDS ENCODING TRANSFERRIN RECEPTOR-LIKE PROTEINS 
AND PRODUCTS RELATED THERETO 



This invention was made with support by Grant No. CA26038-20 awarded by the 
National Institutes of Health. 

RELATED APPLICATION 

This application claims the benefit of a provisional application serial No. 
60/107,502, filed on November 6, 1998 which is hereby incorporated by reference. 

RACKGRGUND OF THE INVENTION 
Area of the Art 

The invention relates generally to the transferrin receptor family and specifically to 
nucleic acid encoding transferrin receptor-like proteins, and products related thereto. 
Description of the Prior Art 

Throughout this application, various references are referred to within parentheses. 
Disclosures of these publications in their entireties are hereby incorporated by reference into 
this application to more fiilly describe the state of the art to which this invention pertains. 
Full bibHographic citation for these references may be found at the end of this application, 
preceding the claims. In addition, the abbreviations used are: TfR, transferrin receptor; RT- 
PCR, reverse transcriptase-polymerase chain reaction; Tf, transferrin; PSMA, prostate 
specific membrane antigen; RACE, rapid amplification of cDNA ends; G3PDH, 
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glyceraldehyde-3-phosphate dehydrogenase; UTR, untranslated region; IRE, iron - 
responsive element; IRP, iron regulatory protein. 

Transferrin receptor (TfR) is a key molecule involved in iron uptake by cells (1, 2). 
On the cell membrane the TfR homodimer binds to two diferric transferrin (Tf) molecules, 
5 resulting in internalization of the complex. In the cytoplasm, iron is released and utilized as 
a co-factor by several proteins, mcluding heme, aconitase, cytochromes (3) and 
ribonucleotide reductase (4), or it may be stored in ferritin molecules. Since dividing cells 
require more iron than non-dividing cells, the expression of TfR is usually higher in rapidly 
dividing tissue (5), such as hematopoietic progenitor cells (6). Also, TfR expression is 

1 0 higher in tumor cells when compared to their normal cellular counterparts (7). The affinity 
of diferric Tf to TfR is modulated by HFE (8, 9). 

The only other known homolog of TfR is PSMA, a human homolog of murine 
NAAG-peptidase (10, 1 1). Since the expression of PSMA is high in prostate cancer, the 
antibody against PSMA was approved for use as an imaging agent to detect metastasis of 

1 5 prostate cancer (1 2). The fimction of PSMA appears to be considerably different from that 
of TfR, despite the modest similarity between their extracellular domains. PSMA does not 
mediate endocytosis, and possesses glutamyl-carboxypeptidase activity (11, 13). 

Given the importance of a transferrin receptor in an iron uptake process of cells, it is 
desirable to identify potential molecules which are homologous to a transferrin receptor and 

20 which perform transferrin receptor-like fimctions. The identification of the molecules may 
provide valuable tools for altering the iron uptake of specific cells. In addition, identified 
novel receptors may be used to identify various new ligand that have activity with other 
metals or other key proteins that are vital for the cells. Furthermore, since TfR expression is 
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higher in tumor cells, the newly identified receptors may be used for diagnosing or treating 
tumor cells. 
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SUMMARY OF THE INVENTION 

It is an object of the present invention to identify the potential transferrin receptor- 
Hke proteins. It is also an object of the present invention to investigate the roles of newly 
discovered receptors in iron metabolism in cells. It is a further object of the present 
5 invention to provide methods for diagnosing tumor cells. 

Accordingly, the present invention provides isolated nucleic acids encoding novel 
transferrin receptor-like (TfR2) polypeptides, or fragments thereof, and isolated TfR2 
polypeptides encoded thereby. Further provided are vectors containing nucleic acids of the 
present invention, host cells transformed therewith, antisense oligonucleotides thereto and 
1 0 compositions containing antibodies that specifically bind to polypeptides of the present 
invention. Methods of detecting TfR2 in a cell are also provided. 

The invention is defined in its fullest scope in the appended claims and is described 
below in its preferred embodiments. 
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DESCRIPTION OF THE FIGURES 

The above-mentioned and other features of this invention and the manner of 
obtaining them will become more apparent and will be best understood by reference to the 
following description, taken in conjunction with the accompanying drawings. These 
drawings depict only a typical embodiment of the invention and do not therefore limit its 
scope. They serve to add specificity and detail, in which: 

FIG. 1 is a gene map of the transferrin receptor 2 (TfR2) gene. 

FIG. 2 shows DNA sequences of exons 3-5 of TfR2 gene. Boxed sequences were 
found only in the p transcript. 

FIG. 3 shows deduced amino acid sequence of TflR2-a, aligned with those for the 
human TfR and PSMA proteins. 

FIGS. 4A and 4B show the results of Northern blot analysis on multiple tissue blots 
of human mRNA (A), and cell line blots of total RNA (B). 

FIGS. 5 A and 5B show the representative results of RT-PCR analyses performed 
with primers for a and p transcripts of TfR2 (35 cycles) as well as G3PDH (27 cycles). 

FIGS. 6A, 6B and 6C show the expression and functional analysis of TfR2-a protein. 

FIG. 7 is the amino acid sequence of TfR2 protein SEQ ID N0:1. 

FIG. 8 is the DNA sequence of TfR2-a gene SEQ ID NO:2. 

FIG. 9 is the DNA sequence of TfR2-p gene SEQ ID N0:3. 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention is based on the discovery and the cloning of a human gene 
homologous to transferrin receptor (TfR). For the purpose of the present invention, this gene 
is termed TfR2 gene. TfR2 gene is cloned, sequenced and mapped to chromosome 7q22. 
Two transcripts expressed from this gene are identified; they are a (about 2.9kb) and p 
(about 2.5 kb) transcripts. The deduced amino acid sequences from each transcript predict 
the possible expression of both a membrane bound and an intracellular form of the TfR2 
protein. The deduced amino acid sequence of TfR2-a protein is a type II membrane protein, 
and shares 45% identity and 66 % similarity in its extracellular domain with TfR. The TfR2- 
P protein lacked the amino terminal protein of the TflR2-a protein including the putative 
transmembrane domain. TfR deficient cells transfected with FLAG-tagged TfR2-a showed 
an increase of biotinylated transferrin (Tf) binding to the cell surface. In addition, these 
transfected cells have a marked increase of Tf-bound ^Ve uptake. 

Accordingly, the present invention provides isolated nucleic acids encoding a TfR2 
polypeptide. Such nucleic acids can be obtained, for example, from human chromosome 
7q22. Deletion or loss of heterozygosity of this chromosomal region has been reported in 
several malignant diseases including myelodysplastic syndromes, acute myeloid leukemia, 
as well as breast, ovarian and pancreatic cancers. The nucleic acids may also be obtained 
from a human cDNA library such as, but not limited to, HL60 cDNA library or TF-1 cDNA 
library. 

The term "nucleic acids" (also referred to as'polynucleotides) refers to a polymer of 
deoxyribonucleotides or ribonucleotides, in the form of a separate fragment or as a 
component of a larger construction. DNA encoding the polypeptide of the invention can be 
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assembled from cDNA fragments or from oligonucleotides which provide a synthetic gene 
which is capable of being expressed in a recombinant transcriptional unit. Polynucleotide 
sequences of the invention include DNA, RNA and cDNA sequences. In accordance with 
one embodiment of the present invention, nucleic acids encode a polypeptide having the 
5 amino acid sequence set forth in SEQ ID N0:1 (see Fig. 7). In accordance with another 
embodiment of the present invention, nucleic acids may include, but are not limited to, 
nucleic acids having substantially the same nucleotide sequence as nucleotides set forth in 
SEQ ID NO: 2 or SEQ ID N0:3 (Fig. 8 and Fig. 9, respectively). In accordance with a 
preferred embodiment, the nucleic acids of the present invention include the same nucleotide 

1 0 sequences as set forth in the SEQ ID NO:2 or 3. As used herein, the phrase "substantially 
the same nucleotide sequence" refers to DNA having sufficient homology to the reference 
polynucleotide, such that it will hybridize to the reference nucleotide under typical 
moderate stringency conditions, DNA having "substantially the same nucleotide sequence" 
as the reference nucleotide sequence has at least 60% homology with respect to the 

1 5 reference nucleotide sequence. 

As used herein, the phrase "isolated" means a nucleic acid that is in a form that does 
not occur in nature. DNA sequences of the invention can be obtained by several methods. 
For example, the DNA can be isolated using hybridization techniques which are well known 
in the art. These include, but are not limited to: 1) hybridization of genomic or cDNA 

20 libraries with probes to detect homologous nucleotide sequences, 2) polymerase chain 

reaction (PGR) on genomic DNA or cDNA using primers capable of annealing to the DNA 
sequence of interest, and 3) antibody screening of expression libraries to detect cloned DNA 
fragments with shared structural features. 
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Preferably the polynucleotide of the invention is derived jfrom a mammalian 
organism, and most preferably from a mouse, rat, or human. Screening procedures which 
rely on nucleic acid hybridization make it possible to isolate any gene sequence from any 
organism, provided the appropriate probe is available. Oligonucleotide probes, which 
5 correspond to a part of the sequence encoding the protein in question, can be synthesized 
chemically. This requires that short, oligopeptide stretches of an amino acid sequence must 
be known. The DNA sequence encoding the protein can be deduced from the genetic code, 
however, the degeneracy of the code must be taken into account. It is possible to perform a 
mixed addition reaction when the sequence is degenerate. This includes a heterogeneous 

1 0 mixture of denatured double-stranded DNA. For such screening, hybridization is preferably 
performed on either single-stranded DNA or denatured double-stranded DNA. 
Hybridization is particularly useful in the detection of cDNA clones derived from sources 
where an extremely low amoxmt of mRNA sequences relating to the polypeptide of interest 
are present. In other words, by using stringent hybridization conditions directed to avoid 

1 5 non-specific binding, it is possible, for example, to allow the autoradiographic visualization 
of a specific cDNA clone by the hybridization of the target DNA to that single probe in the 
mixture which is its complete complement (Wallace, et al., Nucl Acid 9:879, 1981). 

The specific DNA sequences of the present invention can also be obtained by: 1) 
isolation of double-stranded DNA sequences from the genomic DNA; 2) chemical 

20 manufacture of a DNA sequence to provide the necessary codons for the polypeptide of 
interest; and 3) in vitro synthesis of a double-stranded DNA sequence by reverse 
transcription of mRNA isolated from an eukaryotic donor cell. In the latter case, a double- 
stranded DNA sequence by reverse transcription of mRNA isolated from an eukaryotic 
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donor cell. In the latter case, a double-stranded DNA complement of mRNA is eventually 
formed which is generally referred to as cDNA. Of the three above-noted methods for 
developing specific DNA sequences for use in recombinant procedures, the isolation of 
genomic DNA isolates is the least common. This is especially true when it is desirable to 
5 obtain the microbial expression of mammalian polypeptides due to the presence of introns. 
The synthesis of DNA sequences is frequently the method of choice when the entire 
sequence of amino acid residues of the desired polypeptide product is known. When the 
entire sequence of amino acid residues of the desired polypeptide is not known, the direct 
synthesis of DNA sequences is not possible and the method of choice is the synthesis of 

1 0 cDNA sequences. Among the standard procedures for isolating cDNA sequences of interest 
is the formation of plasmid or phage-carrying cDNA libraries which are derived from 
reverse transcription of mRNA, which is abundant in donor cells that have a high level of 
genetic expression. When used in combination with polymerase chain reaction technology, 
even rare expression products can be cloned. In those cases where significant portions of 

1 5 the amino acid sequence of the polypeptide are known, the production of labeled single or 
double-stranded DNA or RNA probe sequences duplicating a sequence putatively present in 
the target cDNA may be employed in DNA/DNA hybridization procedures which are 
carried out on cloned copies of the cDNA which have been denatured into a single-stranded 
form (Jay, et al., Nucl Acid Res., 11:2325, 1983). 

20 DNA sequences encoding TfR2 polypeptides can be expressed in vitro by DNA 

transfer into a suitable host cell. "Host cells" are cells in which a vector can be propagated 
and its DNA expressed. The term also includes any progeny of the subject host cell. It is 
understood that all progeny may not be identical to the parental cell, since there may be 
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mutations that occur during replication. However, such progeny are included when the term 
"host cell" is used. Methods of stable transfer, meaning that the foreign DNA is 
continuously maintained in the host, are known in the art. 

In the present invention, the polynucleotide sequences may be inserted into a 
5 recombinant expression vector. The term "recombinant expression vector" refers to a 
plasmid, virus or other vehicle known in the art that has been manipulated by insertion of 
incorporation of the TfR2 genetic sequences. Such expression vectors contain a promoter 
sequence which facilitates the efficient transcription of the inserted genetic sequence of the 
host. The expression vector typically contains an origin of replication, a promoter, as well as 

1 0 specific genes which allow phenotypic selection of the transformed cells. Vectors suitable 
for use in the present invention include, but are not limited to, the T7-based expression vector 
for expression in bacteria (Rosenberg, et al.. Gene, 56:125, 1987), the pMSXND expression 
vector for expression in mammalian cells (Lee and Nathans, X Biol Chem., 263:3521, 1988) 
and baculovirus-derived vectors for expression in insect cells. The DNA segment can be 

1 5 present in the vector operably linked to regulatory elements, for example, a promoter (e.g., 
T7, metallothionein I, or polyhedrin promoters). 

Polynucleotide sequences encoding TfR2 can be expressed in either prokaryotes or 
eukaryotes. Hosts can include microbial, yeast, insect and mammalian organisms. Methods 
of expressing DNA sequences having eukaryotic or viral sequences in prokaryotes are well 

20 known in the art. Biologically functional viral and plasmid DNA vectors capable of 
expression and replication in a host are known in the art. Such vectors are used to 
incorporate DNA sequences of the invention. 
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Transformation of a host cell with recombinant DNA may be carried out by 
conventional techniques that are well known to those skilled in the art. Where the host is 
prokaryotic, such as E. coli, competent cells which are capable of DNA uptake can be 
prepared from cells harvested after an exponential growth phase and subsequently treated by 
5 the CaChmethod, using procedures well known in the art. Alternatively, MgCl 2 or RBCl 
can be used. Transformation can also be performed after forming a protoplast of the host 
cell, if desired. 

When the host is a eukaryote, such methods of transfection of DNA as calcium 
phosphate co-precipitates, conventional mechanical procedures such as microinjection, 

1 0 electroporation, insertion of a plasmid encased in liposomes, or virus vectors may be used. 
Eukaryotic cells can also be co-transformed with DNA sequences encoding TfR2 of the 
invention, and a second foreign DNA molecule encoding a selectable phenotype, such as the 
herpes simplex thymidine kinase gene. Another method is to use a eukaryotic viral vector, 
such as simian virus 40 (SV40) or bovine papilloma virus, to transiently infect or transform 

1 5 eukaryotic cells and express the protein. (See, for example, Eukaryotic Viral Vectors, Cold 
Spring Harbor Laboratory, Gluzman ed.k 1982). 

Isolation and purification of microbial-expressed polypeptides, or fragments thereof, 
provided by the invention, may be carried out by conventional means including preparative 
chromatography and immunological separations involving monoclonal or polyclonal 

20 antibodies. 

Another aspect of the present invention provides isolated TfR2 polypeptide, or 
fragments thereof, and ftmctional equivalents thereof As used herein, the term "isolated" 
means a protein molecule, free of cellular components, and/or contaminants normally 
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associated with a native in vivo environment. The polypeptides of the present invention 
include any isolated naturally occurring allelic variant, as well as recombinant forms thereof 

Minor modifications of the primary amino acid of the peptide of the present 
invention may result in peptides which have substantially equivalent activity as compared 
5 with the specific peptide described herein. Such modifications may be deliberate, as by site- 
directed mutagenesis, or may be spontaneous. Modification may also be made to the length 
of the peptide of the present invention. It is recognized by those skilled in the art that it is 
possible that a peptide which is longer or shorter than the peptide of the present invention 
may still preserve substantially the same biological function of the peptide of the present 

1 0 invention. All of the peptides produced by these modifications are included herein as long 
as the biological activity of the peptides still exists. 

The polypeptide of the present invention can be isolated using various methods well 
known to a person of skill in the art. The methods available for the isolation and 
purification of the polypeptides of the present invention include precipitation, gel filtration, 

1 5 ion-exchange, reverse-phase and affinity chromatography. Other well-known methods are 
described in Deutscher et al.. Guide to Protein Purification: Methods in Enzymology, Vol 
182 (Academic Press, (1990)), which is incorporated herein by reference. Alternatively, the 
isolated polypeptides of the present invention can be obtained using well-known 
recombinant methods as described, for example, in the Examples. 

20 An example of the means for preparing the invention polypeptides is to express 

nucleic acids encoding the TfR2 in a suitable host cell as described above. Polypeptides of 
the present invention can be isolated directly from cells that have been transformed with 
expression vectors. The polypeptide, biologically active fragments, and functional 
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equivalents thereof can also be produced by chemical synthesis. For example, synthetic 
polypeptides can be produced using Applied Biosystems, Inc. Model 430A or 431 A 
automatic peptide synthesizer (Foster City, CA), employing the chemistry provided by the 
manufacturer. 

5 As used herein, the phrase "TfR2" refers to substantially pure native TfR2 proteins, 

or recombinantly expressed/produced proteins, including variants thereof encoded by 
mRNA, and generated by alternative splicing of a primary transcript, and farther including 
fragments thereof which retain native biological activity. Preferred polypeptides of the 
present invention are those that contain substantially the same amino acid sequence set forth 

10 in SEQ ID N0:1 (Figure 7). In accordance with one embodiment of the present invention, 
the isolated TfR2 polypeptide of the present invention is encoded by at least nucleotides set 
forth in SEQ ID NO:2 or SEQ ID NO:3 (See, Fig. 8 and Fig. 9, respectively). In accordance 
with one embodiment of the present invention, the sizes of the FLAG-tagged TfR2-a 
proteins are about 105 kDa in reducing condition, and about 215 kDa in non-reducing 

1 5 condition. The polypeptides of the present invention may be used to isolate ligands for 
transferrin receptors. 

A further aspect of the present invention provides antibodies which are 
immunoreactive or bind to the peptides of the present invention. Antibodies which consist 
essentially of pooled monoclonal antibodies with different epitopic specificities, as well as 

20 distinct monoclonal antibody preparations, are provided. Monoclonal antibodies are made 
fi^om antigen-containing peptides of the present invention or fragments by methods well 
known in the art (Kohler, et al., Nature^ 256:495, 1975; Current Protocols in Molecular 
Biology, Ausubel et al, ed., 1989). 
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Antibodies which bind to the peptides of the present invention or a region of TfR2 
represented by the peptides of the present invention can be prepared using an intact 
polypeptide or fragments containing peptides of interest as the immunizing antigen. A 
polypeptide or a peptide, such as Sequence ID No.l, used to immunize an animal can be 
5 derived from translated cDNA or chemical synthesis and is purified and conjugated to a 
carrier protein, if desired. Such commonly used carriers which are chemically coupled to 
the peptide include keyhole limpet hemocyanin (KLH), thyroglobulin, bovine serum 
albumin (BSA), and tetanus toxoid. The coupled peptide is then used to immunize the 
animal (e.g., a mouse, a rat, or a rabbit). 
10 If desired, polyclonal antibodies can be further purified, for example, by binding to 

u J and eluting from a matrix to which a polypeptide or a peptide to which the antibodies were 

HI raised is bound. Those of skill in the art will know of various techniques common in the 

jll immunology arts for purification and^r concentration of polyclonal antibodies, as well as 

monoclonal antibodies. (See, for example, Coligan, et al., Unit 9, Current Protocols In 

-.Ml? 

.J! 1 5 Immunology , Wiley Interscience, 1 99 1 , incorporated by reference.) 

If^ The term "antibody" as used in this invention includes intact molecules as well as 

fragments thereof, such as Fab, Fab'2 and Fv, which are capable of binding the epitopic 
determinant. These antibody fragments retain some ability to selectively bind with their 
antigen or receptor and are defined as follows: 
20 (1) Fab, the fragment which contains a monovalent antigen-binding fragment of 

an antibody molecule, can be produced by digestion of a whole antibody with the enzyme 
papain to yield an intact light chain and a portion of one heavy chain; 
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(2) Fab'2, the fragment of an antibody molecule, can be obtained by treating a 
whole antibody with pepsin, followed by reduction, to yield an intact light chain and a 
portion of the heavy chain; two Fab' fragments are obtained per antibody molecule; 

(3) Fab'2, the fragment of the antibody that can be obtained by treating the 

5 whole antibody with the enzyme pepsin without subsequent reduction; Fab '2 is a dimmer of 
two Fab' fragments held together by two disulfide bonds; 

(4) Fv, defined as a genetically engineered fragment containing the variable 
region of the light chain and the variable region of the heavy chain expressed as two chains; 
and 

1 0 (5) Single chain antibody ("SCA"), defined as a genetically engineered molecule 

containing the variable region of the light chain, the variable region of the heavy chain, 
linked by a suitable polypeptide linker as a genetically fiised single chain molecule. 

Methods of making these fragments are known in the art. (See, for example, Harlow 
and Lane, Antibodies: A Laboratorv Manual Cold Spring Harbor Laboratory, New York 
1 5 (1988), incorporated herein by reference.) 

As used in this invention, the term "determinant" means any antigenic determinant 
on an antigen to which the paratope of an antibody binds. Epitopic determinants usually 
consist of chemically active sxirface groupings of molecules, such as amino acids or sugar 
side chains, and usually have specific three- dimensional structural characteristics, as well as 
20 specific charge characteristics. 

It is also possible to use the anti-idiotype technology to produce monoclonal 
antibodies which mimic an epitope. For example, an anti-idiotypic monoclonal antibody 
made to a first monoclonal antibody will have a binding domain in the hypervariable region 
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which is the "image' of the epitope boimd by the first monoclonal antibody. Thus, in the 
present invention, an anti-idiotype antibody produced from an antibody which binds to, for 
example, the synthetic peptide of Sequence ID NO.l, can act as a competitive inhibitor for 
site on TfR2 which is required for iron metabolism in cells. 
5 The antibodies of the present invention can be used to isolate the polypeptides of the 

present invention. Additionally, the antibodies are useful for detecting the presence of 
polypeptides of the present invention, as well as analysis of chromosome localization, and 
structural as well as functional domains. 

Accordingly, another aspect of the present invention provides methods for detecting 

1 0 the presence of polypeptides of the present invention on the surface of a cell. The method 
comprises contacting the cell with an antibody that specifically binds to TfR2 polypeptides, 
under conditions permitting binding of the antibody to the polypeptides, detecting the 
presence of the antibody bound to the cell, and thereby detecting the presence of TfR2 
polypeptides of the present invention on the surface of the cell. With respect to the detection 

15 of such polypeptides, the antibodies can be used for in vitro diagnostics or in vivo imaging 
methods. 

Immunological procedvires useful for in vitro detection of target TfR2 polypeptides 
in a sample include immunoassays that employ a detectable antibody; such immunoassays 
include, for example, ELISA, Pandex microfluorimetric assay, agglutination assays, flow 
20 cytometry, serum diagnostic assays and immunohistochemical staining procedures which 
are well known in the art. An antibody can be made detectable by various means well- 
known in the art. For example, a detectable marker can be directly or indirectly attached to 
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the antibody; useful markers include, for example, radionucleotides, enzymes, fluorogens, 
chromogens and chemiluminescent labels. 

Furthermore, the antibodies of the present invention can be used to modulate the 
activity of the TfR2 polypeptide in living animals, in humans, or in biological tissues or 
5 fluids isolated therefrom. Accordingly, the present invention provides compositions 

comprising a carrier and an amount of an antibody having specificity for TflR2 polypeptides 
effective to block binding of naturally occurring ligands to TfR2 polypeptides. 

Another aspect of the present invention provides an antisense oligonucleotide capable 
of specifically binding to any portion of an mRNA that encodes TfR2 polypeptides so as to 

1 0 prevent or inhibit translation of the mRNA and inhibiting the translation of mRNA of TfR2 
polypeptides. The antisense oligonucleotide may have a sequence capable of binding 
specifically with any portion of the sequence of the cDNA encoding TfR2 polypeptides. As 
used herein, the phrase "binding specifically" encompasses the ability of a nucleic acid 
sequence to form double-helical segments therewith via the formation of hydrogen bonds 

1 5 between the complementary base pairs. An example of an antisense oligonucleotide is an 
antisense oligonucleotide comprising chemical analogs of nucleotides. 

In accordance with the present invention, it is provided compositions comprising an 
amount of the antisense oligonucleotide, described above, effective to reduce expression of 
TfR2 polypeptides by passing through a cell membrane and binding specifically with mRNA 

20 encoding TfR2 polypeptides so as to prevent translation and an acceptable hydrophobic 

carrier capable of passing through a cell membrane. Antisense oligonucleotide compositions 
are useful to inhibit translation of mRNA encoding TfR2 polypeptides. In accordance with 
one embodiment of the present invention, kits comprising the antisense of the present 
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invention are also provided for inhibiting the expression of TfR2 polypeptides. In 
accordance with another embodiment of the present invention, the compositions may be used 
to modulate levels of expression of TfR2 polypeptides. 

The present invention also provides compositions containing an acceptable carrier 
5 and any isolated, purified TfR2 polypeptide, an active fragment thereof, or a purified, mature 
protein and active fragments thereof, alone or in combination with each other. These 
polypeptides or proteins can be recombinantly derived, chemically synthesized or purified 
from native sources. As used herein, the term "acceptable carrier" encompasses any of the 
standard pharmaceutical carriers, such as phosphate buffered saline solution, water and 
1 0 emulsions such as an oil/water or water/oil emulsion, and various types of wetting agents. 

EXAMPLES 
EXPERIMENTAL PROCEDURES 
Cell Lines. HL-60, KG-1, U937 (myeloid leukemia); TF-1, K562 (erythroid 
leukemia); Jurkat, Molt-4 (T cell leukemia); Raji (Burkitt's lymphoma); LNCaP, PC-3 
1 5 (prostate cancer); MCF-7, MDA-MB-23 1 (breast cancer); IMR-32 (neuroblastoma); SK- 
Hepl (hepatoma); HepG2 (hepatoblastoma); U-20S (osteosarcoma) and SW480 (colon 
cancer) cell lines were obtained from American Type Culture Collection (ATCC, Manassas, 
VA). ML-1, NB4 and Kasumi 3 (myeloid leukemia), and both CHO-TRVb (TfR deficient 
Chinese hamster ovary) and TRVb-1 (human TfR stably transfected TRVb) cells were 
20 kindly provided by Drs. Minowada (14), Lanotte (15), Asou (16) and McGraw (17), 

respectively. Human mononuclear cells were isolated from the blood of a normal volunteer 
by centrifiigation on a FicoU-Paque (Pharmacia, Piscataway, NJ) gradient at 400 x g for 30 
min. Informed consent was obtained from the individual. 
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Molecular Cloning of cDNA and genomic DNA. Complementary DNA libraries 
were constructed from TF-1 and HL60 cells using a commercial kit (Marathon cDNA 
Amplification Kit, Clontech, Palo Alto, CA) and were used for 5 '-and 3 '-RACE reactions to 
obtain a full-length cDNA clone. Primers A and B (see Table 1) were used for 5*- and 3'- 
5 RACE, respectively. The products of RACE reactions were subcloned into the pGEM- 

Teasy vector (Promega, Madison, WI), We isolated two transcripts of 2.9 (a) and 2.5 (p) kb 
from the TF-1 and HL60 cDNA libraries, respectively. 

TABLE! 
Primer Sequences for TJR2 
1 0 These primers were used to amplify the TfR2 cDNAs in the RACE and RT-PCR 

analyses. Locations of these primers are shown as the nucleotide numbers in the TfR2-a- 
transcript sequence (GenBank accession number AF067864). Also, the locations of primers 
A, C, D and E are shown in FIG. 2. 



Primer 


Sequence 


Direction 


Location 


Name 








A 


5'-CCACACGTGGTCCAGCTTCTGGCGGGAG-3' 


Reverse 


603-576 


B 


5'-CAGTTGCATCATCAGGCCTTCC-3' 


Forward 


1,061-1,082 


C 


5'-ACGTCTCTGGCATCCTTCC-3' 


Forward 


TfR2-B only 


D 


5'-GTGGTCAGTGAGGATGTCAA-3' 


Forward 


376-395 


E 


5'-TGTAGGGGCAGTAGACGTCA-3' 


Reverse 


733-714 



Genomic DNA was isolated from a human genomic library (Lambda FIX II Library, 
1 5 Stratagene, La Jolla, CA) using a 2.2 kbp fragment of the 3'-end of the TfR2 cDNA as a 

probe (shown as probe- 1 in FIG. 1). After restriction enzyme mapping, a 3.85 kb fragment 
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which included exons 4 - 6 was subcloned into the pBluescript II(+) plasmid (Stratagene) 
(FIG. 1). Complementary and genomic DNA sequences were determined using an ABI 
Prism 373 automated sequencer (Perkin-Elmer, Foster City, CA). 

Chromosomal Mapping. The GeneBridge 4 Radiation Hybrid Panel, RH02 
(Research Genetics, Huntsville, AL) was used to determine the chromosomal location of the 
TfR2 gene as previously described (18). The primers A and C amplified a 178 bp fragment 
located in exon 4 (FIG. 2). The PGR products were electrophoresed through a 1 .5 % 
agarose gel, Southern blotted and hybridized with a ^^P-labeled probe of TfR2 (1 kbp 
fragment of the 5'-portion of the p form cDNA; shown as probe-2 in FIG. 1) to identify the 
hybrid clones containing the gene. The results were analyzed by accessing the database at 
the web site http://www-genome.wi.mit.edu/cgi-bin/contig/rhmapper.pl. 

Northern Blot and RT-PCR Analyses. Northern blot and RT-PCR analyses were 
performed as previously described (18) with some modification. Human tissue Northern 
blot membranes and cDNAs were purchased from OriGene (Rockville, MD). For Northern 
blot analysis, two TfR2 cDNA fragments (probe- 1 and -2 as shown in FIG. 1), a human p- 
actin cDNA fragment (OriGene) and an approximately 300 bp TfR cDNA fragment were 
used as probes. For RT-PCR, the a form-specific primers (primers-A and -D) and the p 
form-specific primers (primers-C and -E) were used (Table 1 and FIG. 2). Conditions for 
amplification were 35 cycles of 94°C for 30 s, 56°C for 40 s and 72°C for 1 mm. As a 
control, glyceraldehyde-3-phosphate dehydrogenase (G3PDH) was amplified in a separate 
reaction using primers, 5'-CCATGGAGAAGGCTGGGG-3' and 5'- 
CAAAGTTGTCATGGATGACC-3' for 27 cycles. The product was electrophoresed 
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through a 1.5 % agarose gel, transferred to nylon membranes, hybridized with radiolabeled 
TfR2 and G3PDH probes and autoradiographed. 

Transfection and Immunoblotting. CHO-TRVb cells were mamtained in F12- 
nutrient mixture (Gibco-BRL) supplemented with 5% fetal bovine serum. An amino 
terminal FLAG-tagged TfR2-a cDNA was subcloned into pcDNA3 (Invitrogen, Carlsbad, 
CA). This plasmid (100 ^ig) was transfected into CHO-TRVb cells using Lipofectin (Gibco- 
BRL). For transient expression, cells were harvested 48 h after the transfection. We also 
isolated a stably expressing clone using G418 (200 |ig/ml) selection and a standard limiting 
dilution method. The protein expression was confirmed by immunoblotting using anti- 
FLAG (M5) antibody (Eastman Kodak, New Haven, CT). Immunoblot analysis was 
performed as previously described (19). 

Flow Cytometric Analysis of Tf-binding to the Cell Surface. Approximately 3 x 
10^ cells were incubated with 5 ^ig/ml of biotinylated human holo-Tf (Sigma) in 500 ^1 
MEM a media (GIBCO) either in the presence or absence of nonlabeled human holo-Tf 
(Sigma) or human Lf (Calbiochem, San Diego, CA) for 30 min on ice. After two washes 
with PBS supplemented with 0.1 % bovine serum albumin, the cells were incubated with 
streptavidin-PE (DAKO). The cells were washed twice again and were subsequently 
analyzed by flow cytometry. 

Analysis of Tf-mediated Iron Uptake. One milligram of human apo-Tf (Sigma) in 
0.5 ml of 0.25 M Tris-HCI, 10 ^M NaHC03, pH 8.0 was mixed with 0.5 ml of 100 mM 
disodium nitrilotriacetate containing 0.4 mCi ^^FeCIs (NEM, Boston, MA). The mixture 
was incubated at room temperature for 1 h and radiolabeled Tf was separated by filtration on 
a PD-10 column (Pharmacia). A specific activity of 27,000 cpm/|ag was obtained. Ceils 
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were incubated with ^^Fe-Tf in MEMa media in the presence or absence of 200-fold excess 
of nonlabeled holo-Tf at 37°C with 5% CO2. After washing with PBS, the cells were lysed 
with 0.1 N NaOH and the radioactivity was counted using a liquid scintillation counter. 

RESULTS 

Molecular Cloning, Chromosomal Mapping and the Genomic Structure of the 
TfR2 Gene. We isolated seventeen 5"-RACE clones and ten 3'-RACE clones from the 7F-1 
cDNA library. Assembly of their nucleotide sequences indicate an approximately 2.9 kb 
cDNA sequence (a form; GenBank accession number AF067864). Using 5' and 3' gene- 
specific primers, a cDNA clone encompassing the putative fidl-length coding sequence was 
created by PGR from the TF-1 cDNA library. This indicated that the predicted cDNA 
sequence belonged to an actual expressed mRNA. When we used a HL60 cDNA library for 
cloning TfR2, the 5'-RACE products were shorter than those from the TF-1 library, and the 
sequences around the 5'-end were different (P form). All 5'-RACE products from the TF-1 
library belonged to the a form, and all 5'-RACE products from HL60 belonged to the p 
form. 

FIG. 1 shows the map of the TfR2 gene. According to FIG 1, an approximately 16 
kbp genomic fragment was cloned from a human genomic library (genomic clone 1) and 
restriction enzyme sites were mapped. A 3.85 kbp fragment of the genomic clone 1 (shown 
as a shaded bar) was subcloned into the pBluescript II(+) plasmid and sequenced. The exon- 
intron borders shown in this figure were based on data deposited in the GenBank (accession 
number AP053356) with some modifications based on our data. The a transcript contains 
1 8 exons (closed boxes on the line). The B transcript lacks exons 1 - 3, and has an additional 
142 bases at the 5'-end of exon 4 (an open box on the Ime). The lower two boxes are the 
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structures of the a and 6 transcripts. IC, TM and EC indicate the sequences encoding 
intracellular, transmembrane and extracellular domains, respectively. The locations of the 
probes that were used in the present invention are shown under the boxes. 

According to the radiation hybrid panel analysis, TfR2 mapped on chromosome 
7q22, between the D7S651 and WI-5853 markers. The restriction enzyme mapping and 
partial sequencing of a 1 6 kb genomic DNA clone and comparison with the deposited 
unpublished genomic sequence (20) revealed that the a form consisted of 18 exons (FIG. 1). 
However, some differences between their exon-intron borders and ours were noted. Our 
DNA sequence of the TfR2-a transcript contained an additional 81 nucleotides in exon 8 
(nucleotides 1,053 -1,133 in the TfR2-a transcript; GenBank accession number AF067864) 
and lacked 18 nucleotides in exon 18 (between nucleotides 2,163 and 2,164 ) as compared 
with their predicted mRNA sequence (20). This resulted in a twenty-seven amino acid 
addition and a six amino acid deletion for our predicted TfR2-a protein. Also, our mRNA 
sequence contained an additional 298 nucleotides in the 3'-untranslated region (UTR) 
(nucleotide 2580 to 2877). 

The p form, which may be an alternative product of splicing or promoter usage, 
lacked exons 1 , 2 and 3, and its first exon (exon 4 of the a form) had an additional 142 
nucleotide bases at the 5'-end (FIGS. 1 and 2). FIG. 2 shows the DNA sequences of exons 3 
- 5. Boxed sequences were found only in the p transcript. Arrows with solid and broken 
Hues indicate the primer sequences used to synthesize the a and 6 transcripts, respectively, 
by RT-PCR. Putative translation initiation codon for the B transcript is shown as bold 
"ATG". Guanines at -3 and +4, which are consistent with Kozak's sequence for this 
initiation codon, are underlined. 
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The Primary Structure of TfR2 Proteins and mRNAs. The predicted amino acid 
sequence of TfR2-a is shown in FIG. 3. FIG. 3 shows the deduced amino acid sequence of 
TfEl2-a-aligned with those for the human TfR and PSMA proteins. Identical residues are 
boxed. Hydrophobic amino acid stretches located in the putative transmembrane portions 
are shaded. The internalization motif of TfR. and the correspondingly similar motif of TfR2- 
a are double underlined. Predicted initial methionine of Tfll2-P is shown as a bold letter. 

The hydrophobic stretch of residues from 81 to 104 following a pair of arginines 
represents the predicted transmembrane domain. It is located close to the amino terminus, 
similar to the transmembrane domains of TfR and PSMA (shaded section in FIG. 3)(10, 21). 
By analogy to TfR and PSMA, TfR2-a probably is a type II membrane protein. Therefore, 
residues 1 to 80 of TfR2-a may be the cytoplasmic domain and residues 105 to 801 the 
extracellular domain. In the extracellular domain, amino acid sequence homologies between 
TfR2-a and either TfR or PSMA were quite high. The extracellular domain of TfR2-a was 
45 % identical and 66 % similar with that of TfR. With PSMA, the identity was 27 % and 
the similarity 60%. The cysteine residues at positions 89 and 98 of TfR form disulfide 
bonds resultmg in homodimerization. Two cysteine residues at positions 108 and 1 1 1 in 
TfR2-a are located in an analogous region and may serve a similar fimction. In addition, 
TfR2-a contains the motif YQRV (amino acid 23-26) in the middle of the cytoplasmic 
domain, that may fimction as an internalization signal, similar to the YTRF motif in TfR 
(FIG. 3, double underlined)(22-24). 

The p transcript lacks exons 1 to 3, which encode the entire transmembrane and 
cytoplasmic domains as well as part of the extracellular domain including the two cysteine 
residues at 108 and 1 1 1 . The additional 142 nucleotide 5'-sequence in exon 4 does not 
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contain an initiation codon. Translation probably starts at the ATG located at nucleotide 
542, which is in frame with the a transcript ORF. The predicted initial methionine is shown 
in FIG. 2, exon 4 and FIG. 3 as bold "ATG" and "M", respectively. This ATG contains a G 
at positions -3 and +4 indicating it is an ideal start site for translation (25). Hence the 
predicted protein product of the |3 transcript would lack both a transmembrane domain and 
signal peptide, resulting in a possible intracellular protein that may or may not be functional. 

Although the primary structure of the TfR2-a protein seemed to be quite similar to 
TfEl, the 3'-UTR of the TfR2 mRNA was shorter than that of the TfR transcript. Also, a 
typical iron-responsive element (IRE) was not present in the UTRs of either of the TfR2 
transcripts (26). 

Characterization of T£R2 mRNA Expression. FIGS. 4A and 4B show the results 
of the Northern blot analysis of poly A+ RNA from human tissues. Hybridization was with 
^^P-labeled TfR2 probes. FIG.4A shows multiple tissue blots of human mRNA were 
hybridized with a TfR2 probe (probe No. 1 in FIG. 1). Membranes were hybridized in the 
same bottle at the same time, and the autoradiograms were developed after a 12 hr exposure. 
In FIG. 4B, thirty micrograms of total RNA from cell lines were loaded hi each lane and 
hybridized with a TfR2 probe (probe No. 2) and a TfR probe. A ^^P-labeled p-actin probe 
was used as a control for all blots. Molecular weight markers or the positions of ribosomal 
RNA are indicated on the left. 

Northern blot analysis of poly A+ RNA from human tissues showed that a 2.9 kb 
mRNf A for TfR2 was expressed predominantly in the liver and, to a lesser degree, in the 
stomach (FIG. 4A). This corresponded with the length of TfR2-a cDNA isolated from TF-1 
cells. In addition, faint bands at 4 kb (stomach) and 1.7 kb (liver, lung, small intestine, 
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stomach, testis and placenta) were observed. These bands may reflect the presence of 
additional alternative forms of TfR2 mRNA. Northern blot analysis of total RNA of various 
cell lines revealed a high expression of TfR2-a in K562 (erythroleukemia) and HepG2 
(hepatoblastoma) (FIG. 4B). The expression levels of TfR2-a were not always correlated 
5 with those of TfR (FIG. 4B). No transcripts corresponding to TfR2-P (2.5 kb) were 
observed by Northern blot analysis. 

To compare the expression of the a and (3 transcripts, RT-PCR was performed using 
specific primers for each form. FIGS. 5 A and 5B show the representative results of RT- 
PCR analyses. RT-PCRs were performed with primers for a and B transcripts of TfR2 (35 

1 0 cycles) as well as G3PDH (27 cycles). The products were electrophoresed through L5 % 
agarose gels, transferred to nylon membranes, hybridized with radiolabeled probes and 
autoradiographed. FIG. 5A shows cDNA panels of human tissues. (MNC; human peripheral 
blood mononuclear cells.) FIG. 5B shows cDNAs from various human cell lines. 
Experiments were repeated at least twice for each sample, and the figures are representative 

15 resuks. The cDNAs for ML-1, Kasumi-3, HL60 and MDA-MB-231 are negative, but 
showed trace levels of a form expression in other experiments, 

FIGS. 5A and 5B show that using a human tissue cDNA panel as a template, the 
expression of the a form was limited to the liver, spleen, lung, muscle, prostate and 
peripheral blood mononuclear cells (FIG, 5 A). On the other hand, expression of the p form 

20 occurred in all of the human tissues tested. Human cancer cell Hues from various tissues 
were studied for expression of the two transcripts. Most of the cell lines expressed both 
transcripts except three; SK-Hepl (hepatoma) lacked both a and p transcripts, HepG-2 
(hepatoblastoma) and ML-1 (myeloblast) lacked the p transcript (FIG. 5B). Neither deletion 
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nor rearrangement of the TfR2 gene was detected in Southern blot analysis in SK-Hepl 
(data not shown). 

Tf-binding to the TfR2-a Transfected Cells. To analyze the function of TfR2-a, 
we stably transfected CHO-TRVb cells, which lack functional TfR, with FLAG-tagged 
5 T£R2-a. FIGS. 6 A, 6B and 6C show the expression and functional analysis of TfR2-a 
protein. In FIG. 6A5 Tf-binding to the cell surface was examined in neomycin resistant 
control CHO-TRVb cells (left panels), FLAG-tagged TfR2-a stably transfected cells 
(middle) and TRVb-l,TfR stably transfected cells (right). The cells were incubated with 5 
p,g/ml of biotinylated human holo-Tf in MEM a media for 30 min on ice. After washing 

1 0 with PBS, the cells were incubated with streptavidin-PE, and analyzed by flow cytometry. 
The solid lines show the histograms without competition. Competition experiments were 

performed in the presence of either 10-fold ( ) or 100-fold ( ) excess of 

either noniabeled Tf (upper panels) or Lf (lower panels). In FIG. 6B, Tf-mediated ^^Fe 
uptake was examined in neomycin resistant control CHO-TRVb cells (Neo cells), human 

1 5 TfR stably transfected cells (TfR cells) and FLAG-tagged TfR2-a stably transfected cells 
(TfR2 cells). Closed symbols (-C) represent cold competition experiments with 200-fold 
excess of noniabeled Tf. The mean + S. D. from either quadruplicate (without competition) 
or tripUcate (cold competition) experiments is shown. In FIG. 6C, cell lysates from 
pcDNAS transiently transfected cells (lane 1) and FLAG-tagged TfR2 transfected cells 

20 (lanes 2 and 3) were electrophoresed through a 4-15% linear gradient SDS-polyacrylamide 
gel. For the sample in lane 3, 2-mercaptoethanol was omitted from the sample buffer. After 
transferring to a PVDF membrane, FLAG-fusion proteins were detected by immunoblotting. 
The positions of molecular weight markers are indicated on the left. 
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In FIGS. 6A, 6B and 6C, the cell surface Tf-binding was examined using 
biotinylated Tf and flow cytometry. Neomycin resistant control cells were almost negative 
for the cell surface Tf-binding (FIG. 6 A, left). TRVb-1, the human TfR stably transfected 
cells were positive for cell surface binding of Tf, and this binding was competed by 

5 nonlabeled Tf but not by Lf (FIG. 6A, middle). For the CHO-TRVb cells stably expressing 
TfR2-a, the mean level of cell surface Tf-binding was clearly higher than that of the control 
cells (FIG. 6 A, right, solid lines). In competition experiments, 10-fold excess of nonlabeled 
Tf markedly inhibited the binding of biotinylated Tf, but even 100-fold excess of Lf did not 
inhibit the binding (FIG. 6A, right, broken lines). Tf-binding to the TfR2-a cells was also 

1 0 examined in a transient expression system using CHO-TRVb cells, and the levels of Tf- 
binding to the cell surface were consistently as follows: TfR cells > TfR2-a cells > pcDNA3 
cells (data not shown). 

Tf-mediated ^^Fe Uptake of theTfR2-a-Transfected Cells. Human TfR and TfR2- 
a stably transfected CHO-TRVb cells were incubated with ^^Fe-Tf, and ^^Fe uptake was 

1 5 measvired. Neomycin resistant CHO-TRVb cells were used as controls. Tf-mediated ^^Fe 
uptake by the TfiR2-a cells was comparable to TfR cells; both were clearly higher than 
control cells (Fig. 6B). Competition by 200-fold excess of nonlabeled Tf ahnost completely 
blocked ^^Fe incorporation in these three cell lines after a 5 h incubation (Fig. 6B). In spite 
of the absence of functional TflR., a slight uptake of Tf-mediated ^^Fe was also observed in 

20 the control TRVb cells as previously reported by Chan, et al. (27). 

Dimerization of the FLAG-tagged T£R2-a Proteins Expressed in Mammalian 
Cells. Cell lysates from the cells transiently transfected with pcDNA3 empty vector or the 
FLAG-tagged TfR2-a plasmid were examined by immimoblotting using anti-FLAG 
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antibody (FIG. 6C). Two closely migrated bands of --105 kDa were observed in the cell 
lysate transfected with FLAG-tagged TfR2-a under reducing conditions (lane 2). When 2- 
mercaptoethanol was omitted from the sample loading buffer, the doublet of -^105 kDa 
decreased, but a protein of -215 kDa appeared (lane 3). Faint bands of --260 kDa and -^125 
5 kDa were also seen under non-reducing conditions (lane 3, arrows). 

DISCUSSION 

The primary structure of the TfR2-a protein deduced from its mRNA is similar to 
that of TfR (see RESULTS). In addition, TfR2-a transfected cells showed increases of both 
Tf-binding and Tf-mediated iron uptake (FIG. 6A and B). However, the mechanisms that 

1 0 regulate expression of TfR2 and TfR may be different. Levels of the TfR protein are 
regulated post-transcriptionally through IREs in its 3'-UTR, to which iron regulatory 
protein-I (IRP-1) and IRP-2 can bind. In cells lacking sufficient iron, IRPs bind to the iron- 
responsive elements of TfR mRNA and stabilize these transcripts. In the presence of excess 
intracellular iron, IRPs are released, leading to degradation of the TfR mRNA. In rapidly 

1 5 growing cells, proto-oncogene c-MYC represses H-ferritin and upregulated IRP-2, and the 
upregulation of IRP-2 may increase TfR protein expression (28). Neither the 3'- nor the 5^- 
UTRs of the TfR2 mRNAs have a detectable IRE-like structure, suggesting another 
mechanism(s) may regulate TfR2 expression. 

Northem blot analysis using normal human poly A"* RNA from a variety of tissues 

20 showed that the liver was the only cell type that prominently expressed TfR2-a (FIG, 4A). 
Also, TfR2-a was expressed highly in the K562 erythroleukemic cell line which is capable 
of hemoglobin synthesis (FIG. 4B). This result suggests that erythroid hematopoietic cells 
may also express high levels of TfR2-a. The major product of red blood cells is 
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hemoglobin which contains abimdant iron, and if TfR2-a is involved in iron transport, it 
would be expected to be strongly expressed on these cells. In erythroid cells, Cotner et al. 
predicted the presence of an alternative form of TfR using a set of monoclonal antibodies 
against TfR (29). Their findings may be ascribed to TfR2-a. 
5 The size of the FL AG-tagged TfR2-a expressed in mammalian cells is --105 kDa in 

the presence of a reducing agent, and is -^215 kDa in the absence of a reducing agent (FIG. 
6C), indicating dimerization of TfEl2-a through disulfide bonds. The size of FLAG-tagged 
TfR2-a monomer, -^105 kDa, is larger than the molecular weight calculated fi*om the amino 
acid sequence (-90 kDa). This may reflect post-translational modifications of the protein 

1 0 such as glycosylation. Actually there are 4 putative N-glycosylation sites (amino acids 240- 
243, 339-342, 540-543 and 754-757) in the TfR2-a protein. Hence, the double bands of 
-105 kDa seen in FIG. 6C may be due to different states of glycosylation. In addition, faint 
bands of -^260 kDa and -125 kDa just above the clear bands of -215 kDa and -105 kDa, 
respectively, were observed under non-reducing conditions (FIG. 6C, lane 3, arrows). These 

1 5 faint bands may reflect interaction of TflEl2-a with a small protein (-20 kDa) through 
disulfide bonds, which may or may not be a ligand. 

To investigate the fimction of TfR2, Tf and other Tf family members were 
considered as candidate ligands of TfR2. Six members of Tf family have been cloned to 
date; Tf, Lf, melanotransferrin (30), ovotransferrin, saxiphilin (31), and porcine inhibitor of 

20 carbonic anhydrase (32). The last two do not possess iron-binding properties and the last 
three have not been identified in humans. Melanotransferrin is an unlikely TfR2 ligand 
because it is a membrane-bound protein of melanoma cells. Only Tf and Lf remained as 
candidates. The CHO-TRVb cells transfected with FLAG-tagged Tfil2-a showed higher 
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levels of Tf-binding to the cell surface than did the control cells (FIG. 6A). This indicates 
that FL AG-tagged TfR2-a was expressed on the cell surface and was bound by Tf. This 
binding was effectively competed by nonlabeled Tf but not by Lf (FIG. 6 A). This indicates 
that Tf can bind to TfR2-a more specifically than can Lf In addition, Tf-mediated iron 
5 uptake by TflR2-a transfected cells was obviously higher than that of control cells (FIG. 6B). 
However, if the only ligand for TfR2-a is Tf and the main function of TfR2-a is 
cellular iron uptake, why do the cells have two different receptors for Tf? TfR2-a may 
simply be another transferrin receptor with a different affinity. Possibly, the fate of the Tf / 
TfR2-a complex on the cell surface may be different from that of the Tf / TflR complex. 

1 0 The putative internalization motif of TfR2-a is not identical to that of TfR, and even a minor 
difference of the internalization motif may result in different destinations of the endosomes 
(24). Still, the possibility that TfR2-a has another specific ligand other than Tf remains. 
Recently, the field of iron metabolism has been markedly advanced by the discoveries of 
HFE, mutations of which occur in most of the patients with hereditary hemochromatosis (8, 

1 5 9), and Nramp2, an intestinal iron transporter (33, 34). Does TfR2-a-a bind to HFE, which 
normally forms a complex with TfR on the cell membrane? If it does, TfR2-a may affect 
the cellular iron uptake by chelating HFE. Can TfR2-a form a heterodimer with TfR? This 
may also affect cellular iron uptake. Elucidation of the precise role of TfR2 may provide an 
important step for clarifying the mechanisms and the regulation of cellular iron uptake. 

20 We cloned two different forms of transcripts from TfR2 gene, a and p. Two 

different transcripts are also expressed from the PSMA gene, another member of the TfR- 
like family. The shorter form of PSMA lacks the 5*-end encoding the transmembrane 
domain (35), similar to the p-form of TfR2. Nearly a 100-fold difference in the ratio of 
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expression of the longer and the shorter forms of PSMA mRNA has been reported during 
progression of prostate cancer, with the shorter form predominant in normal cells and the 
longer form predominant in the cancer cells (36). Using the extremely sensitive RT-PCR 
method, we could distinguish expression of the (a and p forms of the TfR2 gene. Among 
5 normal tissues, the expression of the a (longer) form was detected in the liver, spleen, lung, 
muscle, prostate and peripheral blood mononuclear cells (FIG. 5A). The P form was 
distributed more widely. Interestingly, the two cell lines derived from the liver (SK-Hepl 
and HepG2) lacked expression of the p form, whereas most cell lines from other tissues as 
well as normal liver expressed this shorter form (FIGS. 5A and 5B). 
1 0 We mapped TfR2 to chromosome 7q22. Deletion or loss of heterozygosity of this 

chromosomal region has been reported in several malignant diseases including 
myelodysplastic syndromes, acute myeloid leukemia, as well as breast, ovarian and 
pancreatic cancers (37-41). It is speculated that TfR2 mutations may occur in these cancers. 
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WHAT IS CLAIMED IS : 

1 . An isolated nucleic acid encoding a TfR2 polypeptide. 

2. The isolated nucleic acid of claim 1 , wherein the nucleic acid comprises 

DNA. 

3. The isolated nucleic acid of claim 2, wherein the DNA is a cDNA. 

4. The isolated nucleic acid of claim 1, wherein the DNA encodes a polypeptide 
having the amino acid sequence set forth in SEQ ID NO: 1 . 

5. The isolated nucleic acid of claim % wherein the DNA has substantially the 
same nucleotide sequence as the sequence set forth in SEQ ID NO:2 or SEQ ID NO:3. 

6. A recombinant expression vector comprising DNA of claim 2. 

7. A host cell containing a vector of claim 6, wherein the cell is a procaryotic 
cell or a eucaryotic cell. 

8. A host cell of claim 7, wherein the cell expresses a functional TfR2 protein. 

9. Isolated mRNA complementary to DNA of claim 2. 

10. An oligonucleotide composition comprising chemical analogues of the 
nucleic acid of claim 2 operatively linked to a promoter of RNA transcription. 

11. An antisense oligonucleotide capable of specifically binding to and inhibiting 

the translation of mRNA of claim 9. 

12. An isolated TfR2 polypeptide, or fragments thereof, and functional 
equivalents thereof. 

13. The isolated TfR2 polypeptide of claim 12 having at least substantially the 
same amino acid sequence as that set forth in SEQ ID NO : 1 . 
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14. The isolated TfR2 polypeptide of claim 12 being encoded by at least the 
nucleotide set forth in SEQ ID N0:2 or SEQ ID N0:3. 

15. A TfR2 polypeptide expressed recombinantly in a host cell. 

16. An antibody that specifically binds to a determinant on a TfR2 polypeptide of 
claim 12 or active fragment thereof. 

17. The antibody of claim 16, wherein the antibody is a monoclonal antibody. 

1 8. The antibody of claim 16, wherein the antibody is a polyclonal antibody. 

19. A composition comprising an amount of the antisense oligonucleotide of 
claim 1 1 effective to modulate expression of a TfR2 polypeptide and an acceptable 
hydrophobic carrier capable of passing through a cell membrane. 

20. A composition comprising an amount of an antibody of claim 1 6 effective to 
block the function of the TfR2 protein or to block interaction of the TfR2 protein with other 
proteins or ligands. 

21 . A method for detecting the presence of TfR2 protein on a cell surface 
comprising the steps of: 

(a) providing an antibody specific for TfR2 protein, 

(b) contacting the cell with the antibody under conditions that allow the binding 
of the antibody to the TfR2 protein of the cell, and 

(c) detecting the antibody bound to the cell. 

22. The method of claim 2 1 , wherein the antibody is labled with a detectable 
marker. 
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23 . The method of claim 22, wherein the detectable marker is selected from a 
group consisting of radionucleotides, enzymers, fluorogens, chromogens, and 
chemiluminescent labels. 
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ABSTRACT 

The present invention provides isolated nucleic acids encoding TfR2 polypeptides, or 
fragments thereof, and isolated TfR2 polypeptides encoded thereby. Further provided are 
vectors containing the nucleic acids of the present invention, host cells transformed therewith, 
antisense oligonucleotides thereto and compositions containing antibodies that specifically bind 
to invention polypeptides. Methods of detecting TfR2 protein in a cell are also provided. 
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FIG. 2 

Exon 3 of Tf R2-a 

CCTTCCTACTGGGCTACGTCGCCTTCCGAGGGTCCTGCCAGGCGTGCGGAGACTCTGTGT 
TG GTGGTCAGTGAGGATGTC^ CTATGAGCCTGACCTGGATTTCCACCAGGGCAGACTCT 

^ Rrimer-D 

ACTGGAGCGACCTCCAGGCCATGTTCCTGCAGTTCCTGGGGGAGGGGCGCCTGGAGGACA 
CCATCAG 



Exon 4 (boxed sequence Is in the TfR2-p only) 



GCGTCCGCGGGGAGCGCTCTTTTCCTAAACTCAGGAACCCCTCGCCGCCCCTGCCCCTGG 
CGACCCCACGTCTCTGGCATCCTTCgCTCTTCCCTCCCTCTCCTCCGGGCGCCCAAAAAA 

" " Primer^C 



GTCCCCACCTCTCCCCGCTTAGGCAAACCAGCCTTCGGGAACGGGTGGCAGGCTCGGCCG 



rtaATCnr.rGCITCTGACTCAGGACATTCGCGCGGCGCT qTCCCGCCAGAAGCTGGACCACG 

Primer-A 

ISXSfiACCGACACGCACTACGTGGGGCTGCAATTCCCGGATCC 



Exon 5 (common for both a- and p-forms) 

GGCTCACCCCAAC^CCCTGCACTGGGTO^ 

Primer-E 

GCTGGAGGACCCTGACGTCTACTGCCCCTACAGCGCCATCGGCAACGTCACG 
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Fig. 7 alpha amino acid sequence 



MERLWGLFQRAQQLS?RSSQTVYQRVEGPRKGHL£EEE£DGE£GAETLAHFC?MELRGPEPLGSRPRQPNLI?WAAAGRR 
AAPYLVLTALLIFTG2.FLLGYVAFRG3CQACGDSVLVVSEDVNYEPDLDFHQGRLYWSDLQAiMFLQFLGEGRLEDTIRQT 
SLRERVAGSAGMAALTQDIRAALSRQKLDHVWTOTHYVGLQFPDPAHPNTLHWVDEAGKVGEQLPLEDPDVYCPYSAIGN 
VTGELVYAHYGRPEDLQDLRARGVDPVGRLLLVRVGVI3FAQEC/TNAQDFGAQGVLIYPEPADFSQDPPKPSLSSQQAVY 
GHVHLGTGDPYTPGFPSFNQTQFPPVASSGLPSIPAQPISADIASRLLRKLKGPVAPQEWQGSLLGSPYHLGPGPRLRLV 
VNNHRTSTrlNNIFGCIEGRSEPDHYVVIGAQRDAWGPGAAKSAVGTAILLELVRTFSSMVSNGFRPRRSLLFrSWDGGD 
FGSVGSTEWLEGYLSVLHLKAVVYVSLDNAVLGDDKFHAKTSPLLTSLIESVLKQVDSPNKSGQTLYEQVVFTNPSWDAE 
VIRPLPMDSSAYSFTAFVGVPAVEFSFMEDDQAYPFLHTKEDTYENLHKVLQGRLPAVAQAVAQLAGQLLIRLSHDRLL? 
LDFGRYG0VVLRHIGNLNEFSGDLKARGLTLQWVYSARGDYIRAAEKLRQEIYSSEERDERLTRMYNVRIMRVEFYFL3Q 
YVSPADSPFRHIFMGRGDHTLGALLDHLRLLRSNSSGTPGATSSTGFQESRFRRQI^LLTWTLQGAANALSGDV'WNXDNN 



Fig. 3 alpha DNA sequence 

CTGCAGGCTTCAGGAGGGGACACAAGCATGGAGCGGCTTTGGGGTCTATTCCAGAGAGcGCAACAACTGTCCCCAAGATC 
CTCTCAGi^CCGTCTACCAGCGTGTGGAAGGCCCCCGGAAAGGGCACCTGGAGGAGGAAGAGGAAGACGGGGAGGAGGGGG 
C^-GAGACA^'^GGCCCACTTCTGCCCCATGGAGCTGAGGGGCCCTGAGCCCCTGGGCTCTAGACCCAGGCAGCCAAACCTC 
AT^^CCTGGGCGGCAGCAGGACGGAGGGCTGCCCCCTACCTGGTCCTGACGGCCCTGCTGATCTTCACTGGGGCCTTCCT 
ACTGGGCTACGTCGCCTTCCGAGGGTCCTGCCAGGCGTGCGGAGACTCTGTGTTGGTGGTCAGTGAGGATGTCAACTATG 
AGCCTGACCTGGATTTCCACCAGGGcagactctactggaGcgacCtccaGgccatgttcctgcagttcctgggggaGggg 
cgc-tggaGgaCaccarCAGGCAAACCAGCCTTCGGGAACGGGTGGCAGGCTCGGCCGGGATGGCCGCTCTGACTCAGGA 
CA^TCGCGCGGCGCTCTCCCGCCAGAAGCTGGACCACGTGTGGACCGACACGCACTACGTGGGGCTGCAATTCCCGGATC 
CGGCTCACCCCAACACCCTGCACTGGGTCGATGAGGCCGGGAAGGTCGGAGAGCAGCTGCCGCTGGAGGACCCTGACGTC 
TAC^GCCCCTACAGcGCCATCGGCAACGTCACGGGAGAGCTGGTGTAcGCCCACTACGGGCGGCCCGAAGACCTGCAGGA 
CcTGCGGGCCAGGGGCGTGGATCCAGTGGGCCGCCTGCTGCTGGTGCGCGTGGGGGTGATCagcTTCGCCCAGAAGGTGA 
C^AATGCTCAGGACTTCGGGGCTCAAGGAGTGCTCATATACCCAGAGCCAGCGGACTTCTCCCAGGACCCACCCAAGCCA 
AGCCTGTCCAGCCAGCAGGCAGTGTATGGACATGTGCACCTGGGAACTGGAgACCCcTACACACCTGGCTTCCCTTCCTT 
CAATCAAACCCAGTTCCCTCCAGTTGCATCATCAGGCCTTCCCAGCATCCcAGCCCAGCCCATCAGTGCAGACATTGCCT 
CC^GCCTGCTGAGGAAGCTCAAAGGCCCTGTGGCCCCCCAAGAATGGCAGGGGAGCCTCCTAGGCTCCCCTTATCACCTG 
GGCCCCGGGCCACGACTGCGGCTAGTGGTCAACAATCACAGGACCTCCACCCCCATCAACAACATCTTCGGCTGCATCGA 
AGGCCGCTCAGAGCCAGATCACTACGTTGTCATCGGGGCCCAGAGGGATGCATGGGGCCCAGGAGCAGCTAAATCCGCTG 
TGGGGACGGCTATACTCCTGGAGCTGGTGCGGACCTTTTCCTCCATGGTGAGCAACGGCTTCCGGCCCCGCAGAAGTCTr 




CCl!C?iPLPjGCCQThGiQTh'^^ 

'^GACAAGTC'^CATTGAGAGTGTCCTGAAGCAGGTGGATTCTCCCAACCACAGTGGGCAGACTCTCTATGAACAGGTGGTG 
T'^CACCAATCCCAGCTGGGATGCTGAGGTGATCCGGCCCCTACCCATGGACAGCAGTGCCTATTCCTTCACGGCCTTTGT 
'-GGAG'CCC':'GCCGT''GAGTTCTCCTTTATGGAGGACGACCAGGCCTACCCATTCCTGCACACAAAGGAGGACACTTATG 

ag;^acctgcataaggtgctgcaaggccgcctgcccgccgtggcccaggccgtggcccagctcgcagggcagctcctcatc 

C'-GCT'-AGCCACGATCGCCTGCTGCCCCTCGACTTCGGCCGCTACGGGGACGTCGTCCTCAGGCACATCGGGAACCTCAA 

cgagttctctggggacctcaaggcccgcgggctgaccctgcagtgggtgtactcggcgcggggggactacatccgggcgg 




CGGGTGGAGTTCTACTTCCTTTCCCAGTACGTGTCGCUAUC-UIjAU luuucij i V, 1 1 -av^.--v-w<^ ^ -.-.^^GA 

ccacacgc-^gggcgccctgctggaccacctgcggctgctgcgctccaacagctccgggacccccggggccacctcctcca 
c'^ggc'^tccaggagagccgtttccggcgtcagctagccctgctcacctggacgctgcaaggggcagccaatgcgctta^-c 
ggggatgtctggaacattgataacaacttctgaggccctggggatcctcacatccccgtcccccagtcaagagctcctct 
gctcctcgc-tgaatgattcagggtcagggaggtggctcagagtccacctctcattgctgatcaatttctcattacccct 
acacatctctccacggagcccagaccccagcacagatatccacacaccccagccctgcagtgtagctgaccctaatgtga 

CGG-CA'ACTGTCGGTTAATCAGAGAGTAGCATCCCTTCAATCACAGCCCCTTCCCCTTTCTGGGGTCCTCCATACCTAG 

agaccac-ctgggaggtttgctaggccctgggacctggccagctctgttagtgggagagatcgctggcaccatagcctta 

■-^---C-AACAGGTGGTc-GTGGTGAAAGGGGCGTGGAGTTTCAATATCAATAAACCACCTGATATCAATAAGCCAAAA 



\ 



;ig. 9 beca DNA sequence 



GCGTCCGCGGGGAGCGCTCTTTTCCTAA?ICTCAGGAACCCCTCGCCGCCCCTGCCCCTGGCGACCCCACGTCTCTGGCAT 
CCTTCCCTCTTCCCTCCCTCTCCTCCGGGCGCCCAAAAAAGTCCCCACCTCTCCCCGCTTAGGCAAACCAGCCTTCGGGA 
ACGGGTGGCAGGCTCGGCCGGGATGGCCGCTCTGACTGAGGACATTCGCGCGGCGCTCTCCCGCCAGAAGCTGGACCACG 
TGTGGACCGACACGCACTACGTGGGGCTGCAATTCCCGGATCCGGCTCACCCC^^CACCCTGCACTGGGTCGATGAGGCC 
GGGAAGGTCGGAGAGCAGCTGCCGCTGGAGGACCCTGACGTCTACTGCCCCTACAGcGCCATCGGCAACGTCACGGGAGA 
GCTGGTGTAcGCCGACTACGGGCGGCCCGAAGACGTGCAGGACcTGCGGGCCAGGGGCGTGGATCCAGTGGGCCGCCTGC 
TGCTGGTGCGCGTGGGGGTGATCagcTTCGCCCAGAAGGTGACCAATGCTCAGGACTTCGGGGCTCAAGGAGTGCTCATA 
TACCGAGAGCGAGCGGACTTCTCCCAGGACCCACCCAAGCCAAGCCTGTCCAGCGAGCAGGCAGTGTATGGACATGTGCA 
CCTGGGAACTGGAgACGCcTACACACCTGGCTTCCCTTCCTTGAATCAAACCCAGTTCCCTCCAGTTGCATCATCAGGCC 
TTCCCAGCATCCcAGCCCAGCCCATCAGTGCAGACATTGCCTCCCGCCTGCTGAGGAAGCTCAAAGGCCCTGTGGCCCCC 
CAAGAATGGCAGGGGAGCCTCCTAGGCTCCCCTTATCACCTGGGCCCCGGGCCACGACTGCGGCTAGTGGTCAACAATCA 
CAGGACCTCCACCCCCATCAACAACATCTTCGGCTGCATCGAAGGCCGCTCAGAGCCAGATCACTACGTTGTCATCGGGG 
CCCAGAGGGATGCATGGGGCCCAGGAGCAGCTAAATCCGCTGTGGGGACGGCTATACTCCTGGAGCTGGTGCGGACCTTT 
TCCTCCATGGTGAGCAACGGCTTCCGGCCCCGCAGAAGTCTCCTCTTCATCAGCTGGGACGGTGGTGACTTTGGAAGCGT 
GGGCTCCACGGAGTGGCTAGAAGGCTACCTCAGCGTGCTGCACCTCAAAGCCGTAGTGTACGTGAGCCTGGACAACGCAG 
TGCTGGGGGATGACAAGTTTCATGCCAAGACCAGCCCCCTTCTGACAAGTCTCATTGAGAGTGTCCTGAAGCAGGTGGAT 
TCTCCCAACCACAGTGGGCAGACTCTCTATGAACAGGTGGTGTTCACCAATCCCAGCTGGGATGCTGAGGTGATCGGGCC 
CCTACCCATGGACAGCAGTGCCTATTCCTTCACGGCCTTTGTGGGAGTCCCTGCCGTCGAGTTCTCCTTTATGGAGGACG 
ACCAGGCCTACCCATTCCTGCACACAAAGGAGGACACTTATGAGAACCTGCATAAGGTGCTGCAAGGCCGCCTGCCCGCC 
GTGGCCCAGGCCGTGGCCCAGCTCGCAGGGCAGCTCCTCATCCGGCTCAGCCACGATCGCCTGCTGCCCCTCGACTTCGG 
CCGCTACGGGGACGTGGTCCTCAGGCACATCGGGAACCTCAACGAGTTCTCTGGGGACCTCAAGGCCCGCGGGCTGACCG 
TGCAGTGGGTGTACTCGGCGCGGGGGGACTACATCCGGGCGGCGGAAAAGCTGCGGCAGGAGATCTACAGCTCGGAGGAG 
AGAGACGAGCGACTGACACGCATGTACAACGTGCGCATAATGCGGGTGGAGTTCTACTTCCTTTCCCAGTACGTGTCGCC 
AGCGGACTCCCCGTTGCGCCACATCTTCATGGGCCGTGGAGACCACACGCTGGGCGCCCTGCTGGACCACCTGCGGCTGC 
TGCGCTCCAACAGCTCCGGGACCCCCGGGGCCACCTCCTCCACTGGCTTCCAGGAGAGCCGTTTCCGGCGTCAGCTAGCC 
CTGCTCACCTGGACGCTGCAAGGGGCAGCCAATGCGCTTAGCGGGGATGTCTGGAACATTGATAACAACTTCTGAGGCCC 
TGGGGATCCTCACATCGGCGTCCCCCAGTCAAGAGCTCCTCTGCTCCTCGCTTGAATGATTCAGGGTCAGGGAGGTGGCT 
CAGAGTCCACCTCTCATTGCTGATCAATTTCTCAT7ACCCCTACACATCTCTCCACGGAGCCCAGACCCCAGCACAGATA 
TCCACACACCCCAGCCCTGCAGTGTAGCTGACCCTAATGTGACGGTCATACTGTCGGTTAATCAGAGAGTAGCATCCCT? 
CAATCACAGCCCCTTCCCCTTTCTGGGGTCCTCCATACCTAGAGACCACTcTGGGAGGTTTGCTAGGCCCTGGGACCTGG 
CCAGCTCTGTTAGTGGGAGAGATCGCTGGCACCATAGCCTTATGGCCAACAGGTGGTcTGTGGTGAAAGGGGCGTGGAGT 
TiCAATATCAATAAACCACCTGATATCAATAAGCCAAAA 
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<400> 1 

Met Glu Arg Leu Trp Gly Leu Phe Gin Arg Ala Gin Gin Leu Ser Pro 
15 10 15 

Arg Ser Ser Gin Thr Val Tyr Gin Arg Val Glu Gly Pro Arg Lys Gly 
20 25 30 

His Leu Glu Glu Glu Glu Glu Asp Gly Glu Glu Gly Ala Glu Thr Leu 
35 40 45 

Ala His Phe Cys Pro Met Glu Leu Arg Gly Pro Glu Pro Leu Gly Ser 
50 55 60 

Arg Pro Arg Gin Pro Asn Leu lie Pro Trp Ala Ala Ala Gly Arg Arg 
65 70 75 80 

Ala Ala Pro Tyr Leu Val Leu Thr Ala Leu Leu He Phe Thr Gly Ala 
85 90 95 

Phe Leu Leu Gly Tyr Val Ala Phe Arg Gly Ser Cys Gin Ala Cys Gly 
100 105 110 

Asp Ser Val Leu Val Val Ser Glu Asp Val Asn Tyr Glu Pro Asp Leu 
115 120 125 

Asp Phe His Gin Gly Arg Leu Tyr Trp Ser Asp Leu Gin Ala Met Phe 

1 



130 135 140 

Leu Gin Phe Leu Gly Glu Gly Arg Leu Glu Asp Thr lie Arg Gin Thr 
145 150 155 160 

Ser Leu Arg Glu Arg Val Ala Gly Ser Ala Gly Met Ala Ala Leu Thr 
165 170 175 

Gin Asp lie Arg Ala Ala Leu Ser Arg Gin Lys Leu Asp His Val Trp 
180 185 190 

Thr Asp Thr His Tyr Val Gly Leu Gin Phe Pro Asp Pro Ala His Pro 
195 200 205 

Asn Thr Leu His Trp Val Asp Glu Ala Gly Lys Val Gly Glu Gin Leu 
210 215 220 

Pro Leu Glu Asp Pro Asp Val Tyr Cys Pro Tyr Ser Ala He Gly Asn 
225 230 235 240 

Val Thr Gly Glu Leu Val Tyr Ala His Tyr Gly Arg Pro Glu Asp Leu 
245 250 255 

Gin Asp Leu Arg Ala Arg Gly Val Asp Pro Val Gly Arg Leu Leu Leu 
260 265 270 

Val Arg Val Gly Val He Ser Phe Ala Gin Lys Val Thr Asn Ala Gin 
275 280 285 

Asp Phe Gly Ala Gin Gly Val Leu He Tyr Pro Glu Pro Ala Asp Phe 
290 295 300 

Ser Gin Asp Pro Pro Lys Pro Ser Leu Ser Ser Gin Gin Ala Val Tyr 
305 310 315 320 

Gly His Val His Leu Gly Thr Gly Asp Pro Tyr Thr Pro Gly Phe Pro 
325 330 335 

Ser Phe Asn Gin Thr Gin Phe Pro Pro Val Ala Ser Ser Gly Leu Pro 
340 345 350 

Ser He Pro Ala Gin Pro He Ser Ala Asp He Ala Ser Arg Leu Leu 
355 360 365 

Arg Lys Leu Lys Gly Pro Val Ala Pro Gin Glu Trp Gin Gly Ser Leu 
370 375 380 

Leu Gly Ser Pro Tyr His Leu Gly Pro Gly Pro Arg Leu Arg Leu Val 
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385 



390 



395 



400 



Val Asn Asn His Arg Thr Ser Thr Pro He Asn Asn He Phe Gly Cys 
405 410 415 

He Glu Gly Arg Ser Glu Pro Asp His Tyr Val Val He Gly Ala Gin 
420 425 430 

Arg Asp Ala Trp Gly Pro Gly Ala Ala Lys Ser Ala Val Gly Thr Ala 
435 440 445 

He Leu Leu Glu Leu Val Arg Thr Phe Ser Ser Met Val Ser Asn Gly 
450 455 460 

Phe Arg Pro Arg Arg Ser Leu Leu Phe He Ser Trp Asp Gly Gly Asp 
465 470 475 480 

Phe Gly Ser Val Gly Ser Thr Glu Trp Leu Glu Gly Tyr Leu Ser Val 
485 490 495 

Leu His Leu Lys Ala Val Val Tyr Val Ser Leu Asp Asn Ala Val Leu 
500 505 510 

Gly Asp Asp Lys Phe His Ala Lys Thr Ser Pro Leu Leu Thr Ser Leu 
515 520 525 

He Glu Ser Val Leu Lys Gin Val Asp Ser Pro Asn His Ser Gly Gin 
530 535 540 

Thr Leu Tyr Glu Gin Val Val Phe Thr Asn Pro Ser Trp Asp Ala Glu 
545 550 555 560 

Val He Arg Pro Leu Pro Met Asp Ser Ser Ala Tyr Ser Phe Thr Ala 
565 570 575 

Phe Val Gly Val Pro Ala Val Glu Phe Ser Phe Met Glu Asp Asp Gin 
580 585 590 

Ala Tyr Pro Phe Leu His Thr Lys Glu Asp Thr Tyr Glu Asn Leu His 
595 600 605 

Lys Val Leu Gin Gly Arg Leu Pro Ala Val Ala Gin Ala Val Ala Gin 
610 615 620 

Leu Ala Gly Gin Leu Leu He Arg Leu Ser His Asp Arg Leu Leu Pro 
625 630 635 640 

Leu Asp Phe Gly Arg Tyr Gly Asp Val Val Leu Arg His He Gly Asn 
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OR 



645 650 655 

Leu Asn Glu Phe Ser Gly Asp Leu Lys Ala Arg Gly Leu Thr Leu Gin 

660 665 670 

Trp Val Tyr Ser Ala Arg Gly Asp Tyr lie Arg Ala Ala Glu Lys Leu 
675 680 685 

Arg Gin Glu lie Tyr Ser Ser Glu Glu Arg Asp Glu Arg Leu Thr Arg 
690 695 700 

Met Tyr Asn Val Arg He Met Arg Val Glu Phe Tyr Phe Leu Ser Gin 
705 710 715 720 

Tyr Val Ser Pro Ala Asp Ser Pro Phe Arg His He Phe Met Gly Arg 
725 730 735 

Gly Asp His Thr Leu Gly Ala Leu Leu Asp His Leu Arg Leu Leu Arg 
740 745 750 

Ser Asn Ser Ser Gly Thr Pro Gly Ala Thr Ser Ser Thr Gly Phe Gin 
755 760 765 

Glu Ser Arg Phe Arg Arg Gin Leu Ala Leu Leu Thr Trp Thr Leu Gin 
770 775 780 

Gly Ala Ala Asn Ala Leu Ser Gly Asp Val Trp Asn He Asp Asn Asn 
785 790 795 800 

Phe 



<210> 2 
<211> 2877 
<212> DNA 

<213> human genome 
<400> 2 

ctgcaggctt caggagggga cacaagcatg 
caacaactgt ccccaagatc ctctcagacc 
gggcacctgg aggaggaaga ggaagacggg 
tgccccatgg agctgagggg ccctgagccc 
attccctggg cggcagcagg acggagggct 
atcttcactg gggccttcct actgggctac 
ggagactctg tgttggtggt cagtgaggat 
cagggcagac tctactggag cgacctccag 
cgcctggagg acaccatcag gcaaaccagc 



gagcggcttt ggggtctatt ccagagagcg 60 
gtctaccagc gtgtggaagg cccccggaaa 120 
gaggaggggg cggagacatt ggcccacttc 180 
ctgggctcta gacccaggca gccaaacctc 240 
gccccctacc tggtcctgac ggccctgctg 300 
gtcgccttcc gagggtcctg ccaggcgtgc 360 
gtcaactatg agcctgacct ggatttccac 42 0 
gccatgttcc tgcagttcct gggggagggg 480 
cttcgggaac gggtggcagg ctcggccggg 540 
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atggccgctc tgactcagga cattcgcgcg gcgctctccc gccagaagct ggaccacgtg 600 
tggaccgaca cgcactacgt ggggctgcaa ttcccggatc cggctcaccc caacaccctg 660 
cactgggtcg atgaggccgg gaaggtcgga gagcagctgc cgctggagga ccctgacgtc 720 
tactgcccct acagcgccat cggcaacgtc acgggagagc tggtgtacgc ccactacggg 780 
cggcccgaag acctgcagga cctgcgggcc aggggcgtgg atccagtggg ccgcctgctg 840 
ctggtgcgcg tgggggtgat cagcttcgcc cagaaggtga ccaatgctca ggacttcggg 900 
gctcaaggag tgctcatata cccagagcca gcggacttct cccaggaccc acccaagcca 960 
agcctgtcca gccagcaggc agtgtatgga catgtgcacc tgggaactgg agacccctac 1020 
acacctggct tcccttcctt caatcaaacc cagttccctc cagttgcatc atcaggcctt 1080 
cccagcatcc cagcccagcc catcagtgca gacattgcct cccgcctgct gaggaagctc 1140 
aaaggccctg tggcccccca agaatggcag gggagcctcc taggctcccc ttatcacctg 1200 
ggccccgggc cacgactgcg gctagtggtc aacaatcaca ggacctccac ccccatcaac 1260 
aacatcttcg gctgcatcga aggccgctca gagccagatc actacgttgt catcggggcc 1320 
cagagggatg catggggccc aggagcagct aaatccgctg tggggacggc tatactcctg 1380 
gagctggtgc ggaccttttc ctccatggtg agcaacggct tccggccccg cagaagtctc 1440 
ctcttcatca gctgggacgg tggtgacttt ggaagcgtgg gctccacgga gtggctagaa 1500 
ggctacctca gcgtgctgca cctcaaagcc gtagtgtacg tgagcctgga caacgcagtg 1560 
ctgggggatg acaagtttca tgccaagacc agcccccttc tgacaagtct cattgagagt 1620 
gtcctgaagc aggtggattc tcccaaccac agtgggcaga ctctctatga acaggtggtg 1680 
ttcaccaatc ccagctggga tgctgaggtg atccggcccc tacccatgga cagcagtgcc 1740 
tattccttca cggcctttgt gggagtccct gccgtcgagt tctcctttat ggaggacgac 1800 
caggcctacc cattcctgca cacaaaggag gacacttatg agaacctgca taaggtgctg 1860 
caaggccgcc tgcccgccgt ggcccaggcc gtggcccagc tcgcagggca gctcctcatc 1920 
cggctcagcc acgatcgcct gctgcccctc gacttcggcc gctacgggga cgtcgtcctc 1980 
aggcacatcg ggaacctcaa cgagttctct ggggacctca aggcccgcgg gctgaccctg 2040 
cagtgggtgt actcggcgcg gggggactac atccgggcgg cggaaaagct gcggcaggag 2100 
atctacagct cggaggagag agacgagcga ctgacacgca tgtacaacgt gcgcataatg 216 0 
cgggtggagt tctacttcct ttcccagtac gtgtcgccag ccgactcccc gttccgccac 2220 
atcttcatgg gccgtggaga ccacacgctg ggcgccctgc tggaccacct gcggctgctg 22 80 
cgctccaaca gctccgggac ccccggggcc acctcctcca ctggcttcca ggagagccgt 2340 
ttccggcgtc agctagccct gctcacctgg acgctgcaag gggcagccaa tgcgcttagc 2400 
ggggatgtct ggaacattga taacaacttc tgaggccctg gggatcctca catccccgtc 2460 
ccccagtcaa gagctcctct gctcctcgct tgaatgattc agggtcaggg aggtggctca 2520 
gagtccacct ctcattgctg atcaatttct cattacccct acacatctct ccacggagcc 2580 
cagaccccag cacagatatc cacacacccc agccctgcag tgtagctgac cctaatgtga 2640 
cggtcatact gtcggttaat cagagagtag catcccttca atcacagccc cttccccttt 2700 
ctggggtcct ccatacctag agaccactct gggaggtttg ctaagccctg ggacctggcc 2760 
agctctgtta gtgggagaga tcgctggcac catagcctta tggccaacag gtggtctgtg 2820 
gtgaaagggg cgtggagttt caatatcaat aaaccacctg atatcaataa gccaaaa 2877 



<210> 3 
<211> 2519 
<212> DNA 

<213> human genome 



<400> 3 

gcgtccgcgg ggagcgctct tttcctaaac tcaggaaccc ctcgccgccc ctgcccctgg 60 
cgaccccacg tctctggcat ccttccctct tccctccctc tcctccgggc gcccaaaaaa 120 
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gtccccacct ctccccgctt aggcaaacca gccttcggga acgggtggca ggctcggccg 180 
ggatggccgc tctgactcag gacattcgcg cggcgctctc ccgccagaag ctggaccacg 240 
tgtggaccga cacgcactac gtggggctgc aattcccgga tccggctcac cccaacaccc 300 
tgcactgggt cgatgaggcc gggaaggtcg gagagcagct gccgctggag gaccctgacg 360 
tctactgccc ctacagcgcc atcggcaacg tcacgggaga gctggtgtac gcccactacg 420 
ggcggcccga agacctgcag gacctgcggg ccaggggcgt ggatccagtg ggccgcctgc 480 
tgctggtgcg cgtgggggtg atcagcttcg cccagaaggt gaccaatgct caggacttcg 540 
gggctcaagg agtgctcata tacccagagc cagcggactt ctcccaggac ccacccaagc 600 
caagcctgtc cagccagcag gcagtgtatg gacatgtgca cctgggaact ggagacccct 660 
acacacctgg cttcccttcc ttcaatcaaa cccagttccc tccagttgca tcatcaggcc 720 
ttcccagcat cccagcccag cccatcagtg cagacattgc ctcccgcctg ctgaggaagc 7 80 
tcaaaggccc tgtggccccc caagaatggc aggggagcct cctaggctcc ccttatcacc 840 
tgggccccgg gccacgactg cggctagtgg tcaacaatca caggacctcc acccccatca 900 
acaacatctt cggctgcatc gaaggccgct cagagccaga tcactacgtt gtcatcgggg 960 
cccagaggga tgcatgggcc ccaggagcag ctaaatccgc tgtggggacg gctatactcc 1020 
tggagctggt gcggaccttt tcctccatgg tgagcaacgg cttccggccc cgcagaagtc 1080 
tcctcttcat cagctgggac ggtggtgact ttggaagcgt gggctccacg gagtggctag 1140 
aaggctacct cagcgtgctg cacctcaaag ccgtagtgta cgtgagcctg gacaacgcag 1200 
tgctggggga tgacaagttt catgccaaga ccagccccct tctgacaagt ctcattgaga 1260 
gtgtcctgaa gcaggtggat tctcccaacc acagtgggca gactctctat gaacaggtgg 1320 
tgttcaccaa tcccagctgg gatgctgagg tgatccggcc cctacccatg gacagcagtg 1380 
cctattcctt cacggccttt gtgggagtcc ctgccgtcga gttctccttt atggaggacg 1440 
accaggccta cccattcctg cacacaaagg aggacactta tgagaacctg cataaggtgc 1500 
tgcaaggccg cctgcccgcc gtggcccagg ccgtggccca gctcgcaggg cagctcctca 1560 
tccggctcag ccacgatcgc ctgctgcccc tcgacttcgg ccgctacggg gacgtcgtcc 1620 
tcaggcacat cgggaacctc aacgagttct ctggggacct caaggcccgc gggctgaccc 1680 
tgcagtgggt gtactcggcg cggggggact acatccgggc ggcggaaaag ctgcggcagg 1740 
agate tacag ctcggaggag agagacgagc gactgacacg catgtacaac gtgcgcataa 1800 
tgcgggtgga gttctacttc ctttcccagt acgtgtcgcc agccgactcc ccgttccgcc 1860 
acatcttcat gggccgtgga gaccacacgc tgggcgccct gctggaccac ctgcggctgc 1920 
tgcgctccaa cagctccggg acccccgggg ccacctcctc cactggcttc caggagagcc 1980 
gtttccggcg tcagctagcc ctgctcacct ggacgctgca aggggcagcc aatgcgctta 2040 
gcggggatgt ctggaacatt gataacaact tctgaggccc tggggatcct cacatccccg 2100 
tcccccagtc aagagctcct ctgctcctcg cttgaatgat tcagggtcag ggaggtggct 2160 
cagagtccac ctctcattgc tgatcaattt ctcattaccc ctacacatct ctccacggag 2220 
cccagacccc agcacagata tccacacacc ccagccctgc agtgtagctg accctaatgt 2280 
gacggtcata ctgtcggtta atcagagagt agcatccctt caatcacagc cccttcccct 2340 
ttctggggtc ctccatacct agagaccact ctgggaggtt tgctaggccc tgggacctgg 2400 
ccagctctgt tagtgggaga gatcgctggc accatagcct tatggccaac aggtggtctg 246 0 
tggtgaaagg ggcgtggagt ttcaatatca ataaaccacc tgatatcaat aagccaaaa 2519 
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